Search CORE

159 research outputs found

Clustering of spectra and fractals of regular graphs

Author: Ejov V.
Filar J. A.
Lucas S. K.
Zograf P.
Publication venue
Publication date: 29/08/2007
Field of study

We exhibit a characteristic structure of the class of all regular graphs of degree d that stems from the spectra of their adjacency matrices. The structure has a fractal threadlike appearance. Points with coordinates given by the mean and variance of the exponentials of graph eigenvalues cluster around a line segment that we call a filar. Zooming-in reveals that this cluster splits into smaller segments (filars) labeled by the number of triangles in graphs. Further zooming-in shows that the smaller filars split into subfilars labelled by the number of quadrangles in graphs, etc. We call this fractal structure, discovered in a numerical experiment, a multifilar structure. We also provide a mathematical explanation of this phenomenon based on the Ihara-Selberg trace formula, and compute the coordinates and slopes of all filars in terms of Bessel functions of the first kind.Comment: 10 pages, 5 figure

arXiv.org e-Print Archive

Elsevier - Publisher Connector

University of Queensland eSpace

Discounting in Games across Time Scales

Author: A. Condon
A. Condon
Angelo Montanari
D.A. Martin
J. Filar
Krishnendu Chatterjee
L.S. Shapley
Margherita Napoli
Mimmo Parente
P.R. Kumar
Rupak Majumdar
S. Dziembowski
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2010
Field of study

We introduce two-level discounted games played by two players on a perfect-information stochastic game graph. The upper level game is a discounted game and the lower level game is an undiscounted reachability game. Two-level games model hierarchical and sequential decision making under uncertainty across different time scales. We show the existence of pure memoryless optimal strategies for both players and an ordered field property for such games. We show that if there is only one player (Markov decision processes), then the values can be computed in polynomial time. It follows that whether the value of a player is equal to a given rational constant in two-level discounted games can be decided in NP intersected coNP. We also give an alternate strategy improvement algorithm to compute the value

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

IST Austria: PubRep (Institute of Science and Technology)

MPG.PuRe

Tropical polyhedra are equivalent to mean payoff games

Author: Akian M.
ALEXANDER GUTERMAN
Allamigeon X.
Einsiedler M.
Filar J. A.
Gondran M.
Itenberg I.
Joswig M.
Mallet-Paret J.
MARIANNE AKIAN
STÉPHANE GAUBERT
Vincent J. M.
Zimmermann K.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 09/06/2011
Field of study

We show that several decision problems originating from max-plus or tropical convexity are equivalent to zero-sum two player game problems. In particular, we set up an equivalence between the external representation of tropical convex sets and zero-sum stochastic games, in which tropical polyhedra correspond to deterministic games with finite action spaces. Then, we show that the winning initial positions can be determined from the associated tropical polyhedron. We obtain as a corollary a game theoretical proof of the fact that the tropical rank of a matrix, defined as the maximal size of a submatrix for which the optimal assignment problem has a unique solution, coincides with the maximal number of rows (or columns) of the matrix which are linearly independent in the tropical sense. Our proofs rely on techniques from non-linear Perron-Frobenius theory.Comment: 28 pages, 5 figures; v2: updated references, added background materials and illustrations; v3: minor improvements, references update

arXiv.org e-Print Archive

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Institute of Mathematics AS CR, v. v. i.

HAL-Polytechnique

Non-Zero Sum Games for Reactive Synthesis

Author: A Brandenburger
A Ehrenfeucht
B Aminof
C Baier
C Wu
D Berwanger
D Fisman
E Filiot
EM Clarke
J Filar
J Nash
J-P Queille
JA Filar
JY Halpern
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
L Brim
L Khachiyan
M Faella
M Puterman
M Randour
M Randour
M Ummels
O Kupferman
TA Henzinger
U Zwick
W Damm
Publication venue
Publication date: 17/12/2015
Field of study

In this invited contribution, we summarize new solution concepts useful for the synthesis of reactive systems that we have introduced in several recent publications. These solution concepts are developed in the context of non-zero sum games played on graphs. They are part of the contributions obtained in the inVEST project funded by the European Research Council.Comment: LATA'16 invited pape

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Institutional Repository Universiteit Antwerpen

DI-fusion

HAL-Rennes 1

HAL - UPEC / UPEM

Probabilistic Model Checking for Energy Analysis in Software Product Lines

Author: Baier C.
Bianco A.
Chatterjee K.
Clements P.
Cordy M.
Dinkelaker T.
Dubslaff C.
Filar J.
Gomaa H.
Haverkort B.
Kang K. C.
Kulkarni V.
Millo J.-V.
Noorian M.
Puterman M.
White J.
Publication venue
Publication date: 30/12/2013
Field of study

In a software product line (SPL), a collection of software products is defined by their commonalities in terms of features rather than explicitly specifying all products one-by-one. Several verification techniques were adapted to establish temporal properties of SPLs. Symbolic and family-based model checking have been proven to be successful for tackling the combinatorial blow-up arising when reasoning about several feature combinations. However, most formal verification approaches for SPLs presented in the literature focus on the static SPLs, where the features of a product are fixed and cannot be changed during runtime. This is in contrast to dynamic SPLs, allowing to adapt feature combinations of a product dynamically after deployment. The main contribution of the paper is a compositional modeling framework for dynamic SPLs, which supports probabilistic and nondeterministic choices and allows for quantitative analysis. We specify the feature changes during runtime within an automata-based coordination component, enabling to reason over strategies how to trigger dynamic feature changes for optimizing various quantitative objectives, e.g., energy or monetary costs and reliability. For our framework there is a natural and conceptually simple translation into the input language of the prominent probabilistic model checker PRISM. This facilitates the application of PRISM's powerful symbolic engine to the operational behavior of dynamic SPLs and their family-based analysis against various quantitative queries. We demonstrate feasibility of our approach by a case study issuing an energy-aware bonding network device.Comment: 14 pages, 11 figure

arXiv.org e-Print Archive

Crossref

The Complexity of Nash Equilibria in Simple Stochastic Multiplayer Games

Author: A. Condon
C. Daskalakis
E. Allender
J. Canny
J. Filar
J. Renegar
J.F. Nash Jr.
K. Chatterjee
K. Chatterjee
K. Chatterjee
K. Etessami
L. Alfaro de
L. Alfaro de
M. Ummels
M. Ummels
M.R. Garey
R. Selten
T. Brázdil
V. Conitzer
W. Zielonka
X. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

We analyse the computational complexity of finding Nash equilibria in simple stochastic multiplayer games. We show that restricting the search space to equilibria whose payoffs fall into a certain interval may lead to undecidability. In particular, we prove that the following problem is undecidable: Given a game G, does there exist a pure-strategy Nash equilibrium of G where player 0 wins with probability 1. Moreover, this problem remains undecidable if it is restricted to strategies with (unbounded) finite memory. However, if mixed strategies are allowed, decidability remains an open problem. One way to obtain a provably decidable variant of the problem is restricting the strategies to be positional or stationary. For the complexity of these two problems, we obtain a common lower bound of NP and upper bounds of NP and PSPACE respectively.Comment: 23 pages; revised versio

arXiv.org e-Print Archive

CiteSeerX

Crossref

CWI's Institutional Repository

Edinburgh Research Archive

Publikationsserver der RWTH Aachen University

Value Iteration for Long-run Average Reward in Markov Decision Processes

Author: A Komuravelli
A McIver
AF Veinott
AK McIver
C Baier
C Courcoubetis
J Filar
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
M Duflot
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
ML Puterman
O Michael
RA Howard
S Giro
S Haddad
T Brázdil
T Brázdil
T Brázdil
Publication venue
Publication date: 31/08/2017
Field of study

Markov decision processes (MDPs) are standard models for probabilistic systems with non-deterministic behaviours. Long-run average rewards provide a mathematically elegant formalism for expressing long term performance. Value iteration (VI) is one of the simplest and most efficient algorithmic approaches to MDPs with other properties, such as reachability objectives. Unfortunately, a naive extension of VI does not work for MDPs with long-run average rewards, as there is no known stopping criterion. In this work our contributions are threefold. (1) We refute a conjecture related to stopping criteria for MDPs with long-run average rewards. (2) We present two practical algorithms for MDPs with long-run average rewards based on VI. First, we show that a combination of applying VI locally for each maximal end-component (MEC) and VI for reachability objectives can provide approximation guarantees. Second, extending the above approach with a simulation-guided on-demand variant of VI, we present an anytime algorithm that is able to deal with very large models. (3) Finally, we present experimental results showing that our methods significantly outperform the standard approaches on several benchmarks

arXiv.org e-Print Archive

Crossref

Simulation-Based Graph Similarity

Author: A. Condon
A. Sanfeliu
D. Bertsekas
J. Desharnais
J.A. Filar
L. Alfaro de
L. Alfaro de
M.C. Browne
P. Szor
S. Needleman
T.F. Smith
V.D. Blondel
Publication venue: ScholarlyCommons
Publication date: 01/01/2006
Field of study

We present symmetric and asymmetric similarity measures for labeled directed rooted graphs that are inspired by the simulation and bisimulation relations on labeled transition systems. Computation of the similarity measures has close connections to discounted Markov decision processes in the asymmetric case and to perfect-information stochastic games in the symmetric case. For the symmetric case, we also give a polynomial-time algorithm that approximates the similarity to any desired precision

Crossref

ScholarlyCommons@Penn

Synchronizing Objectives for Markov Decision Processes

Author: A. Aziz
A. Bianco
Bernd Finkbeiner
D. Beauquier
J. Filar
Johannes Reich
Laurent Doyen
M. V. Volkov
Mahsa Shirmohammadi
R. Segala
Thierry Massart
V. A. Korthikanti
Y. Benenson
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2011
Field of study

We introduce synchronizing objectives for Markov decision processes (MDP). Intuitively, a synchronizing objective requires that eventually, at every step there is a state which concentrates almost all the probability mass. In particular, it implies that the probabilistic system behaves in the long run like a deterministic system: eventually, the current state of the MDP can be identified with almost certainty. We study the problem of deciding the existence of a strategy to enforce a synchronizing objective in MDPs. We show that the problem is decidable for general strategies, as well as for blind strategies where the player cannot observe the current state of the MDP. We also show that pure strategies are sufficient, but memory may be necessary.Comment: In Proceedings iWIGP 2011, arXiv:1102.374

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

DI-fusion